Separable Measurement Estimation of Density Matrices 
and its Fidelity Gap with Collective Protocols 
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We show that there exists a gap between the performance of separable and collective measurements 
in qubit mixed-state estimation that persists in the large sample limit. We characterize the gap 
with sharp asymptotic bounds on mean fidelity. We present an adaptive protocol that attains the 
separable-measurement bound. This protocol uses von Neumann measurements and can be easily 
implemented with current technology. 
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Collective measurements provide the largest amount 
of information that can be retrieved from a multipartite 
quantum system. However, they rarely offer much ad- 
vantage over separable (also known as unentangled) mea- 
surements as far as state discrimination Q or estimation 
based on large samples of identical copies is concerned. 
To be more precise, as the sample size N goes to infin- 
ity the (mean-squared) error in the (optimal) estimate 
often vanishes at the same rate for both collective and 
separable measurements, despite the fact that the for- 
mer are fully general whereas the latter are much more 
constrained. Examples of this abound in the literature. 
They include: estimation of qubit pure states (sep- 
arate individual measurements suffice in this case); es- 
timation of spectrum of a qudit density matrix [3j (for 
qubits see Q); discrimination of two non-orthogonal mul- 
tipartite pure states multiple-copy 2-state discrimi- 
nation [f| ; the problem of distinguishing the states of the 
double-trine ensemble 0,0; etc. (In these discrimination 
examples N needs not be asymptotically large.) 

In this letter, we meet the opposite, less common situa- 
tion, but for a very important example. (I), we present a 
state-estimation scenario where separable measurements 
are outperformed by collective measurements even in the 
large sample limit. (II), we give a protocol based solely on 
local operations and classical communication (a so-called 
LOCC protocol) whose mean-squared error (fidelity) at- 
tains the lower (upper) separable-measurement bound. 
To the best of our knowledge this is the first time a com- 
plete example of (I) and (II) is given. The example which 
we provide is of great practical relevance: it concerns the 
estimation of a qubit density matrix p — (1 + r ■ c?)/2, 
|f*| = r < 1 (the components of & are the three Pauli 
matrices) given N identical copies; i.e., assuming we are 
given the state p® N . The fact that the protocol uses 
von Neumann measurements and just one-step adaptiv- 
ity adds greatly to its practical interest, since it can be 
implemented with present-day technology. Similar pro- 
tocols have been proposed earlier in, e.g., but 



they have not been studied from the point of view of aver- 
age fidelity. Our protocol is delicately tuned to attain the 
asymptotic maximum separable-measurement fidelity. 

The estimation of qubit mixed states already showed 
some other puzzling anomalies. In |l l| it was proved 
that the average error using (local) tomography van- 
ishes as iV~ 3 / 4 , which conflicts with the behavior iV _1 
expected on statistical grounds, as well as being much 
worse than the optimal collective results 01 ■ A closer 
look at this problem reveals that this strange power law 
is intimately connected to the particular but very natu- 
ral choice of prior distribution used there. This will be 
explained, after introducing our notation. 

Our aim is to maximize the mean fidelity F = (1 + 
A)/2, where 
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In writing JIJ we have used that (a) the fidelity between p 
and its estimate p x = (1 + R x ■ a)/2 can be cast in the 

form f(p,p x ) = (tr^T^pTp^) 2 = (l+r-R x )/2, where 
the (Euclidean) 4-dimensional vector r (similar ly R x ) is 
defined as r = (yï — r 2 , r); (b) a generalized measure- 
ment on p® N is represented by a positive operator valued 
measure (POVM), O = {O x }; (c) the prior probability 
distribution of p is isotropic (dn is the rotationally invari- 
ant measure on where d — 3,2 depending respec- 

tively on whcthcr we dcal with the entirc Bloch sphere or 
only with its equatorial plane); and (d) the purity (i.e., 
r) prior distribution has probability density w(r). 

This letter focusses on measurements for which O x = 

®k=i°Xk ( so that X = XnXn-I-··Xx is a string of 
outcomes, each of them associated with one of the copies 
of p). By dcfinition, separable measurements are those 
whose components are convex combinations of these "lo- 
cal" operators. All LOCC measurements are separable, 
but not viceversa [ÏÜj . 

It is worth mcntioning that, although we do not stick 
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to any particular purity prior, many arguments favor the 
choice of the Bures distribution, 
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Notice that i s precisely the volume element induced 
by the distance (1/2) arccos/(p, p + dp) 0]. ^ i s mono- 
tonically decreasing under coarse-graining 0|. It has 
been argued that it corresponds to maximal randomness 
of the signal states and hence describes an ensemble for 
which one has minimal prior knowledge [ï^|. 

Let us summarize some known facts about the optimal 
(collective) protocols. For asymptotically largc samplcs 
the maximum value of the fidelity reads ^3] ( see a ls° 13 ) 
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where r stands for the average of the purity with respect 
to its prior (on which very mild regularity conditions need 
be assumed), i.e., f = L drw(r)r. For the Bures prior, 
Eq. ©, we have f = 2/(3tt). Similarly 0, 



F™2 = 1 - W + o(l/N), 



(4) 



which is independent of w(r). These asymptotic bounds 
wcre computed assuming no restriction upon the type of 
measurements used in the protocols. Although they were 
shown to be attained by a specific collective measure- 
ment, one cannot rule out that separable measurements 
can also attain the bounds (especially after our introduc- 
tory remarks about the freqüent asymptotic optimality 
of local protocols). 

To get a hint for this problem, we recali some re- 
sults |ll| concerning tomography, which is a Standard 
scheme for quantum state estimation in the laboratory 
at present. In the simplest approach one performs mea- 
surements of the projections of a along d fixed orthogonal 
directions (one does not make use of classical communi- 
cation), each of them on a corresponding fraction of the 
sample of size N/d. For the Bures prior the best data 
processing leads to 
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+ o(N-^ 4 ), 0<&<6, (5) 



where the specific value of the constant is irrelevant 
to this discussion. This asymptotic behavior contrasts 
drastically with © and (@J, and might lead us to sus- 
pect that there exists no local estimation protocol for 
which 1 — F ~ N^ 1 if w(r) is the Bures prior (J2J). We 
show below that this is not so. 

But let us first provide an explanation of the behavior 
shown in JSJ, which can be traced back to the contribu- 
tion to F from states near the surface of the Bloch sphere 
(almost pure states). Let S £ be the outer shell of the 



Bloch sphere of thickness e, i.e., S £ — {r : 1 — e < r < 1}. 
As an extreme case, consider the fidelity f(p,p') be- 
tween two states of S e whose Bloch vectors point in the 
same direction while one is of length r = 1, the other 
of length r' = 1 - e. We find that 1 - f(p,p') - e/2 
[instead of 1 — f(p, p') ~ e 2 , which holds for states in the 
complement of S e whose Bloch vectors are a distance e 
apart] . For a signal state not in the direction of one of the 
measurement axes, however we process the tomographic 
data x, we cannot hope to reconstruct the location and 
in particular the length of the Bloch vector to an ac- 
curacy better than N^ 1 / 2 . Thus 1 - f(p,p x ) ~ N^ 1 / 2 
when p G S N -i/2. Integrating J2J from r = 1 — e to r = 1, 
we see that the signal state has a probability p ~ e 1 / 2 of 
being in S £ . In particular, p € S N -i/2 with probabil- 
ity p ~ A" 1 / 4 . In this case, the best processing of the 
data can at most result in 1 — f(p,p x ) ~ N^ 1 / 2 . The 
(dominant) contribution to 1 — F from this increasingly 
thin outer shell of signal states, S N -i/2, is therefore of 
order [1 - f(p, p x )] Xp~ N~ 3 / 4 . 

Let us now move on to the central part of our work 
and prové the existence of a gap between the asymptotic 
hdelities of separable and collective protocols. This, in 
turn, provides an upper bound for the LOCC fidelity. 
To this end, we recali some results concerning quantum 
statistical inference theory (a comprehensive summary of 
general results can be found in [ÏJ)- Hereafter we write 
simply / for f(p,p x ). From classical statistical argu- 
ments, the average of the fidelity over the outcomes can 
be expected to be 
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tr{H(p)[I{0,O)]- 1 } + o(l/N), (6) 



where 8 = (r,9,<p) (i.e., the Standard spherical coordi- 
nates), H{6) = diag[l/(l — r 2 ),r 2 ,r 2 sin 2 9] is the quan- 
tum Fisher information matrix (QFI), and 1(9, O) is re- 
lated to the ( "classical" ) Fisher information matrix (FI) 
In{0 : O) corresponding to a measurement O on p® N 
through the equation 1(9, O) = Yíitín^oo I N (9 ,0) / N . 
(For states on the equatorial plane, d = 2, we just drop 
the </>-entry in the above expressions.) Up to a constant 
factor, H is the Riemannian mètric corresponding to the 
fidelity ■ The FI plays a similar role with respect 
to the classical fidelity (overlap) between probability dis- 
tributions. The inverse of H is a lower bound to the 
inverse of the FI which is a lower bound for the variance 
of "reasonable" estimators of 9 [ïj|. 

If one restriets oneself to separable measurements, the 
following bound holds 

tr{[H(9)]- l I(9,0)}<\, (7) 

as proved in Q . It follows straightforwardly that 

tr{H(9)[I(9,0)]- 1 }>d 2 . (8) 
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Eqs. © and (JüJ suggest that for any separable measure- 
ment scheme the following bound should apply: 



and 
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One could moreover hope that the bound remains true 
after averaging with respect to any prior. 

A direct and rigorous proof of the desired result can be 
given using exactly the same arguments as in appendix H 
of Alternatively, it follows from a general theorem 
proved in 0|. Either way, we have the inequality 
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where the second, strict, inequality, which follows 
from J3J| and (0}, proves the existence of a gap between 
the two asymptotic optimal fidelities. 

Within the so-called pointwise approach to quantum 
state estimation, the existence of a gap between op- 
timal collective and separable measurements on multi- 
parameter problems has been known for some time; see 
Hayashi and Matsumoto and their references. In this 
approach, one compares the pointwise rate of conver- 
gence (i.e., at each fixed 0), with respect to mean square 
error, of estimators satisfying regularity conditions (e.g., 
"asymptotically locally unbiased"). Differing efficiencies 
in this approach suggest, but do not prové, that a cor- 
responding gap exists when we compare average (with 
respect to a prior) fidelity of arbitrary estimators. 

We are now in a position to state precisely and prové 
our main result: there exists a (LOCC) one-step adap- 
tive protocol that saturates the separable-measurement 
bound l|10(l . The protocol which, taking inspiration from 
the Gill-Massar approach [8j, makes use of adaptivity 
(and thus of classical communication) only once, is as 
follows. In a first step, we spend a vanishing fraction, 
N a = N (1/2 < a < 1), of copies of p to get a rough 
estimate of n (9 and 0), to which we refer as uq. To this 
purpose we may use, e.g., tomography. 

In a second step we use tomography again on the re- 
maining N — Nq = N\d copies of p, but now we mea- 
sure the projection of a along fio and along d — 1 other 
orthogonal axis in the plane normal to uq. In the fol- 
lowing we refer to these axis as z, x and y respectively; 
they define a spatial reference frame related to the orig- 
inal one through a known rotation. The outcomes of 
this second step can be written as \ = {Xx, Xy> Xz) = 
(2a x — 1, 2a y — 1, 2a z — 1), where a x is the relative fre- 
quency of plusses (+) obtained in the N\ measurements 
of x • <7 {pc y and a z are defined similarly) . The estimate 
of f is given by R x = R x n x , where we have defined 
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n-y = x sin 9 cos < 
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(For d = 2 we drop the y-component of x and set = 0.) 
This protocol is similar to the one used in , where one 
was only interested in estimating the purity. The main 
difference is that, in purity estimation, after the first step 
one measures the rest of the copies along the estimated 
direction. In the case studied here, however, part of the 
copies are used to refine the estimate of the direction. 
The other main difference is in the model: purity estima- 
tion is a one-parameter model, which essentially behaves 
as a "classical" problem, and LOCC protocols do attain 
the optimal (collective) asymptotic accuracy. In contrast, 
the estimation of the whole density matrix is not classi- 
cal and collective measurements can and do provide an 
advantage. 

Let us prové that the fidelity of the protocol above 
attains the separable bound l|lUfl . The accuracy with 
which n is estimated in step one can be quantified by the 
average of C — cos O over the Nq outcomes of this first 
measurement, where O is the angle between n and its 
rough guess n . One has 
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where rj 3 (r) = 3(l/r 2 - 1/5) and rj 2 (r) = (1/r 2 - 1/4). 
A shorthand notation similar to that in Jüjl and (|14f) will 
be used below to denote other averages. E.g., (■■■)? 
will stand for the average over the prior distribution 
dp = dr w(r)dn. Likewise, the average fidelity (after step 
two) can be written as F — (f) x op = (1 + (S) x qp)/2. 
Note that the freqüències a x , a y and a z are binomially 
distributed as a x . y ~ Bin[./Vi,(l + rn x ^ y )/2] and a z ~ 
Bin[A^ l7 (1 + r C)/2], where the components of n are re- 
ferred to the rotated reference frame. Hence, for large 
Ni (N), the components of \ are close to normally dis- 
tributed; Xx, y ~ N[m x>2 /, A r 1 ~ 1 ^ 2 (l — ^nl y) 1 / 2 ] and sim- 
ilarly X z ~ N[rC, A^r V2 (l - r 2 C 2 ) 1/2 }- 

To compute the asymptotic form of A = (5} x op, we 
note that S = r R x = r-R^l-r 2 ) 1 / 2 (l-R^) 1 / 2 = S v + 
S s , as can be read off from Eq. QJ, and make in (S v ) x o 
(no average over the prior) the approximation R x ps r C, 
along with the substitutions sin 9 cos <j) — Xx/Rx ano - 
sin(9sin</> = Xy/Rx- Retaining only terms up to order 9 2 
(on average, 9 is small for large N{) we have 



(S V )xO~r 2 C 2 - 
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(For d = 2 we just drop \y) 



4 



The average (xl + X 2 y )x ( or J ust (xl)x if d = 2 ) can 
be computed trivially recalling that \x,y are almost nor- 
mally distributed random variables. The resulting ex- 
pression can be written as 

(S V ) x o = Y((l-C) 2 ) +r 2 {Oo-^l+ail/Nx), (16) 

where we have used the relation n 2 + n 2 = 1 — C 2 (n 2 = 
1-C 2 ifd = 2). 

We now observe that r 2 (C)o can be approximated 
by {rR x ) Xz o an d that the term ((1 — C) 2 )o is of order N^ 2 
[see Eq. (|14f) ]. The latter is thus subdominant if a > 1/2 
and can be dropped. Thcrefore we obtain (up to the 
order we are interested in) 

A = (rtf x + (l-r 2 )V 2 (l-4) 1/2 ) 

- ^ + °( 1 /^)· (17) 

This is a gratifying result because the term in brack- 
ets can be recognized as the average fidelity used in 
the purity estimation problem discussed in |4|, and we 
just need to borrow the asymptotic expression obtained 
there: F purity = 1 - l/(2iVi) + o(l/JVi). We have 
A = l-d/(2N 1 )+o(l/Nx) = l-d 2 /[2(iV-7V )]+o(l/iV), 
and therefore 

F d = 1 - ^ + o(l/N), (18) 

which is the separable- measurement bound (|10f> . 

Some care regarding the constant a that determines 
the (vanishing) fraction of copies used in the first step 
must be taken in the above derivation. This constant 
must be carefully tailored to the specific choice of the 
prior w(r), Eq. One can show that near r = there 
appears a term of order 0(1/N 5a / 2 ) while near r = 1 
there is a term Ojl/N 3a / 2 ) , which comes from the purity 
estimation part |4J. Therefore the choice a > 2/3 renders 
both terms subdominant. The optimal value of a is hard 
to find analytically, however numerical results suggest 
that it is close to its lower bound, 2/3. 

Let us summarize. We have analyzed LOCC estima- 
tion protocols for qubit mixed states. These are the most 
relevant arrangements for practical purposes. Using sta- 
tistical tools we have obtained an asymptotic bound on 
the fidelity for slightly more general approaches; those 
that use separable measurements. Our specific LOCC 
protocol attains the separable bound. The rate at which 
perfect determination can be attained is comparable to 
that of the completely unrestricted optimal protocol, 
which involves joint measurements: 1 — F goes to zero at 
rate iV -1 for both separable and collective approaches. 
The accuracies, however, exhibit a gap, Eq. (fTTTfl . The 
separable-measurement bounds do not depend on the 
prior distribution. In view of the fact that even opti- 
mal processing of Standard (fixed) tomography leads to 



accuracies that go to zero more slowly than iV -1 for the 
very natural choice of the Bures prior, it is nontrivial and 
gratifying to exhibit an experimentally feasible LOCC 
protocol that saturates the separable bounds (showing 
they are sharp!) and in particular has the 7V~ X rate. 
Our results can be extendcd to the distillation of purc 
states |2(|. Also they can be applied to higher dimen- 
sional systems, e.g., the pointwise approach, Eqs. (|6I10J) . 
show that the asymptotic expression of the fidelity also 
satisfies 1 - F = o(l/N). 
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